FairDistillation: Mitigating Stereotyping in Language Models

نویسندگان

چکیده

Large pre-trained language models are successfully being used in a variety of tasks, across many languages. With this ever-increasing usage, the risk harmful side effects also rises, for example by reproducing and reinforcing stereotypes. However, detecting mitigating these harms is difficult to do general becomes computationally expensive when tackling multiple languages or considering different biases. To address this, we present FairDistillation: cross-lingual method based on knowledge distillation construct smaller while controlling specific We found that our does not negatively affect downstream performance most tasks mitigates stereotyping representational harms. demonstrate FairDistillation can create fairer at considerably lower cost than alternative approaches.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Can Gender-Fair Language Reduce Gender Stereotyping and Discrimination?

Gender-fair language (GFL) aims at reducing gender stereotyping and discrimination. Two principle strategies have been employed to make languages gender-fair and to treat women and men symmetrically: neutralization and feminization. Neutralization is achieved, for example, by replacing male-masculine forms (policeman) with gender-unmarked forms (police officer), whereas feminization relies on t...

متن کامل

‏‎interpersonal function of language in subtitling

‏‎translation as a comunicative process is always said to be associated with various aspects of meaning loss or gain. subtitling as a mode of translating, due to special discoursal and textual conditions imposed upon it, is believed to be an obvious case of this loss or gain. presenting the spoken sound track of a film in writing and synchronizing the perception of this text by the viewers with...

15 صفحه اول

Mitigating Spreadsheet Risk in Complex Multi-Dimensional Models in Excel

Microsoft Excel is the most ubiquitous analytical tool ever built. Companies around the world leverage it for its power, flexibility and ease of use. However, spreadsheets are manually intensive and prone to error, making it difficult for companies to control spreadsheet risk. The following solution is designed to mitigate spreadsheet risk for a set of problems commonly addressed in a spreadshe...

متن کامل

Selective self-stereotyping.

In an examination of group members' responses to the threat of negative in-group characterizations, sorority/fraternity members were asked to rate themselves, their own sorority/fraternity, sororities/ fraternities in general, and students in general on attributes that were stereotypic of sororities/ fraternities. Results showed that individuals selectively self-stereotyped-they embraced positi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Lecture Notes in Computer Science

سال: 2023

ISSN: ['1611-3349', '0302-9743']

DOI: https://doi.org/10.1007/978-3-031-26390-3_37